Multimodal Approach toward Intelligent Video Production
نویسنده
چکیده
This paper presents the framework of our intelligent video production, which has functions for multimodal data acquisition, accumulation to a database, and intelligent video editing. Since multimodal approach is most e ective and promising for semi-/automatic video production, we rst developed a prototype of multimodal acquisition system and database in which human behaviors are recorded in a multimodal way. By consulting it, we can intensively investigate the human behaviors in typical situations. We are developing algorithms for detecting typical human behaviors, for example, deictic movements. The behaviors are good keys to detecting the \focus", which means the most important portions to be recorded and to be presented to the audience. Thus, intelligent video production becomes possible by these intelligent processes. We brie y introduce, in this paper, our idea for detecting the focus of human behaviors in videotaping and emphasizing it.
منابع مشابه
Towards an intelligent framework for multimodal affective data analysis
An increasingly large amount of multimodal content is posted on social media websites such as YouTube and Facebook everyday. In order to cope with the growth of such so much multimodal data, there is an urgent need to develop an intelligent multi-modal analysis framework that can effectively extract information from multiple modalities. In this paper, we propose a novel multimodal information e...
متن کاملA Multimodal Approach toward Teaching for Transfer: A Case of Team-Teaching in ESAP Writing Courses
This paper presents a detailed examination of learning transfer from an English for Specific Academic Purposes course to authentic discipline-specific writing tasks. To enhance transfer practices, a new approach in planning writing tasks and materials selection was developed. Concerning the conventions of studies in learning transfer that acknowledge different learning preferences, the instruct...
متن کاملAchieving Multimodal Cohesion during Intercultural Conversations
How do English as a lingua franca (ELF) speakers achieve multimodal cohesion on the basis of their specific interests and cultural backgrounds? From a dialogic and collaborative view of communication, this study focuses on how verbal and nonverbal modes cohere together during intercultural conversations. The data include approximately 160-minute transcribed video recordings of ELF interactions ...
متن کاملThe impact of proactive and reactive focus on form in multimodal settings on EFL learners' comprehension and production of modal auxiliaries
The major objective of this mixed methods research, which considered elements of both quantitative and qualitative research approaches, was to examine the effect of two different types of focus on form instruction, namely proactive and reactive across multimodal vs. traditional input settings on Iranian EFL learners' comprehension and production of modal auxiliaries. The participants of the stu...
متن کاملMultimodal Computing and Interaction – Robust, efficient, and intelligent processing of text, speech and visual data
The abundance of widely available digital data poses exciting new opportunities. Previously this data was mostly textual, today it includes text, speech, audio, images, video, and other representations. The challenge now is to organize, understand, and search this multimodal information in a robust, efficient and intelligent way, and to create dependable systems that allow natural and intuitive...
متن کامل